AITopics | tensor model

Collaborating Authors

tensor model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improving Group Fairness in Tensor Completion via Imbalance Mitigating Entity Augmentation

Ahn, Dawon, Jang, Jun-Gi, Papalexakis, Evangelos E.

arXiv.org Machine LearningJul-29-2025

Group fairness is important to consider in tensor decomposition to prevent discrimination based on social grounds such as gender or age. Although few works have studied group fairness in tensor decomposition, they suffer from performance degradation. To address this, we propose STAFF(Sparse Tensor Augmentation For Fairness) to improve group fairness by minimizing the gap in completion errors of different groups while reducing the overall tensor completion error. Our main idea is to augment a tensor with augmented entities including sufficient observed entries to mitigate imbalance and group bias in the sparse tensor. We evaluate \method on tensor completion with various datasets under conventional and deep learning-based tensor models. STAFF consistently shows the best trade-off between completion error and group fairness; at most, it yields 36% lower MSE and 59% lower MADE than the second-best baseline.

artificial intelligence, group fairness, machine learning, (17 more...)

arXiv.org Machine Learning

2507.20542

Country:

North America > United States > Illinois > Champaign County > Urbana (0.14)
North America > United States > California > Riverside County > Riverside (0.14)
North America > United States > Illinois > Cook County > Chicago (0.05)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (0.82)

Industry:

Food & Agriculture (0.46)
Education (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Low-Rank Tensors for Multi-Dimensional Markov Models

Navarro, Madeline, Rozada, Sergio, Marques, Antonio G., Segarra, Santiago

arXiv.org Machine LearningNov-4-2024

This work presents a low-rank tensor model for multi-dimensional Markov chains. A common approach to simplify the dynamical behavior of a Markov chain is to impose low-rankness on the transition probability matrix. Inspired by the success of these matrix techniques, we present low-rank tensors for representing transition probabilities on multi-dimensional state spaces. Through tensor decomposition, we provide a connection between our method and classical probabilistic models. Moreover, our proposed model yields a parsimonious representation with fewer parameters than matrix-based approaches. Unlike these methods, which impose low-rankness uniformly across all states, our tensor method accounts for the multi-dimensionality of the state space. We also propose an optimization-based approach to estimate a Markov model as a low-rank tensor. Our optimization problem can be solved by the alternating direction method of multipliers (ADMM), which enjoys convergence to a stationary solution. We empirically demonstrate that our tensor model estimates Markov chains more efficiently than conventional techniques, requiring both fewer samples and parameters. We perform numerical simulations for both a synthetic low-rank Markov chain and a real-world example with New York City taxi data, showcasing the advantages of multi-dimensionality for modeling state spaces.

decomposition, markov chain, tensor, (17 more...)

arXiv.org Machine Learning

2411.02098

Country:

North America > United States > New York (0.25)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Galicia > Madrid (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Government (0.68)
Transportation > Passenger (0.66)
Transportation > Ground > Road (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

LoRTA: Low Rank Tensor Adaptation of Large Language Models

Hounie, Ignacio, Kanatsoulis, Charilaos, Tandon, Arnuv, Ribeiro, Alejandro

arXiv.org Artificial IntelligenceOct-15-2024

Low Rank Adaptation (LoRA) is a popular Parameter Efficient Fine Tuning (PEFT) method that effectively adapts large pre-trained models for downstream tasks. LoRA parameterizes model updates using low-rank matrices at each layer, significantly reducing the number of trainable parameters and, consequently, resource requirements during fine-tuning. However, the lower bound on the number of trainable parameters remains high due to the use of the low-rank matrix model. In this paper, we address this limitation by proposing a novel approach that employs a low rank tensor parametrization for model updates. The proposed low rank tensor model can significantly reduce the number of trainable parameters, while also allowing for finer-grained control over adapter size. Our experiments on Natural Language Understanding, Instruction Tuning, Preference Optimization and Protein Folding benchmarks demonstrate that our method is both efficient and effective for fine-tuning large language models, achieving a substantial reduction in the number of parameters while maintaining comparable performance.

arxiv preprint arxiv, matrix, trainable parameter, (15 more...)

arXiv.org Artificial Intelligence

2410.0406

Country:

North America > United States > Washington > King County > Seattle (0.14)
Africa > Senegal > Kolda Region > Kolda (0.04)
North America > United States > Pennsylvania (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the Accuracy of Hotelling-Type Asymmetric Tensor Deflation: A Random Tensor Analysis

Seddik, Mohamed El Amine, Guillaud, Maxime, Decurninge, Alexis, Goulart, José Henrique de Morais

arXiv.org Machine LearningOct-28-2023

This work introduces an asymptotic study of Hotelling-type tensor deflation in the presence of noise, in the regime of large tensor dimensions. Specifically, we consider a low-rank asymmetric tensor model of the form $\sum_{i=1}^r \beta_i{\mathcal{A}}_i + {\mathcal{W}}$ where $\beta_i\geq 0$ and the ${\mathcal{A}}_i$'s are unit-norm rank-one tensors such that $\left| \langle {\mathcal{A}}_i, {\mathcal{A}}_j \rangle \right| \in [0, 1]$ for $i\neq j$ and ${\mathcal{W}}$ is an additive noise term. Assuming that the dominant components are successively estimated from the noisy observation and subsequently subtracted, we leverage recent advances in random tensor theory in the regime of asymptotically large tensor dimensions to analytically characterize the estimated singular values and the alignment of estimated and true singular vectors at each step of the deflation procedure. Furthermore, this result can be used to construct estimators of the signal-to-noise ratios $\beta_i$ and the alignments between the estimated and true rank-1 signal components.

artificial intelligence, machine learning, tensor, (17 more...)

arXiv.org Machine Learning

2310.18717

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry: Banking & Finance > Economy (0.83)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Nested Matrix-Tensor Model for Noisy Multi-view Clustering

Seddik, Mohamed El Amine, Achab, Mastane, Goulart, Henrique, Debbah, Merouane

arXiv.org Artificial IntelligenceMay-31-2023

In this paper, we propose a nested matrix-tensor model which extends the spiked rank-one tensor model of order three. This model is particularly motivated by a multi-view clustering problem in which multiple noisy observations of each data point are acquired, with potentially non-uniform variances along the views. In this case, data can be naturally represented by an order-three tensor where the views are stacked. Given such a tensor, we consider the estimation of the hidden clusters via performing a best rank-one tensor approximation. In order to study the theoretical performance of this approach, we characterize the behavior of this best rank-one approximation in terms of the alignments of the obtained component vectors with the hidden model parameter vectors, in the large-dimensional regime. In particular, we show that our theoretical results allow us to anticipate the exact accuracy of the proposed clustering approach. Furthermore, numerical experiments indicate that leveraging our tensor-based approach yields better accuracy compared to a naive unfolding-based algorithm which ignores the underlying low-rank tensor structure. Our analysis unveils unexpected and non-trivial phase transition phenomena depending on the model parameters, ``interpolating'' between the typical behavior observed for the spiked matrix and tensor models.

artificial intelligence, machine learning, tensor, (18 more...)

arXiv.org Artificial Intelligence

2305.19992

Country:

Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(6 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Add feedback

Optimizing Orthogonalized Tensor Deflation via Random Tensor Theory

Seddik, Mohamed El Amine, Mahfoud, Mohammed, Debbah, Merouane

arXiv.org Machine LearningMar-16-2023

This paper tackles the problem of recovering a low-rank signal tensor with possibly correlated components from a random noisy tensor, or so-called spiked tensor model. When the underlying components are orthogonal, they can be recovered efficiently using tensor deflation which consists of successive rank-one approximations, while non-orthogonal components may alter the tensor deflation mechanism, thereby preventing efficient recovery. Relying on recently developed random tensor tools, this paper deals precisely with the non-orthogonal case by deriving an asymptotic analysis of a parameterized deflation procedure performed on an order-three and rank-two spiked tensor. Based on this analysis, an efficient tensor deflation algorithm is proposed by optimizing the parameter introduced in the deflation mechanism, which in turn is proven to be optimal by construction for the studied tensor model. The same ideas could be extended to more general low-rank tensor models, e.g., higher ranks and orders, leading to more efficient tensor methods with a broader impact on machine learning and beyond.

artificial intelligence, machine learning, optimizing orthogonalized tensor deflation, (12 more...)

arXiv.org Machine Learning

2302.05798

Country:

Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre: Research Report (0.50)

Industry: Banking & Finance > Economy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Experimental observation on a low-rank tensor model for eigenvalue problems

Hu, Jun, Jin, Pengzhan

arXiv.org Artificial IntelligenceFeb-1-2023

Neural networks-based machine learning methods are rapidly developed for various numerical problems, such as physics-informed neural networks (PINNs) [10, 11, 12], the deep Ritz method [2], and the deep Galerkin method [15]. One of the advantages of these approaches is that they show the possibility for solving high-dimensional problems. In [2], the deep learning techniques as well as the Monte-Carlo integration are used to solve eigenvalue problems, which provides a feasible strategy for high-dimensional cases. For the same eigenvalue problems, [16] applies a neural network-based low-rank tensor model, i.e. the tensor neural network (TNN), with a quadrature scheme to perform efficient numerical integration, and thus it achieves a much better result than [2]. Furthermore, [17] employs the TNN to solve the manybody Schrödinger equation, which emerges the practical value of such low-rank approximation method.

artificial intelligence, machine learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

2302.00538

Country:

Asia > China > Beijing > Beijing (0.05)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the Accuracy of Hotelling-Type Tensor Deflation: A Random Tensor Analysis

Seddik, Mohamed El Amine, Guillaud, Maxime, Decurninge, Alexis

arXiv.org Machine LearningNov-16-2022

Leveraging on recent advances in random tensor theory, we consider in this paper a rank-$r$ asymmetric spiked tensor model of the form $\sum_{i=1}^r \beta_i A_i + W$ where $\beta_i\geq 0$ and the $A_i$'s are rank-one tensors such that $\langle A_i, A_j \rangle\in [0, 1]$ for $i\neq j$, based on which we provide an asymptotic study of Hotelling-type tensor deflation in the large dimensional regime. Specifically, our analysis characterizes the singular values and alignments at each step of the deflation procedure, for asymptotically large tensor dimensions. This can be used to construct consistent estimators of different quantities involved in the underlying problem, such as the signal-to-noise ratios $\beta_i$ or the alignments between the different signal components $\langle A_i, A_j \rangle$.

artificial intelligence, machine learning, tensor, (17 more...)

arXiv.org Machine Learning

2211.09004

Genre: Research Report (0.40)

Industry: Banking & Finance > Economy (0.85)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

When Random Tensors meet Random Matrices

Seddik, Mohamed El Amine, Guillaud, Maxime, Couillet, Romain

arXiv.org Machine LearningJan-12-2022

Relying on random matrix theory (RMT), this paper studies asymmetric order-$d$ spiked tensor models with Gaussian noise. Using the variational definition of the singular vectors and values of (Lim, 2005), we show that the analysis of the considered model boils down to the analysis of an equivalent spiked symmetric block-wise random matrix, that is constructed from contractions of the studied tensor with the singular vectors associated to its best rank-1 approximation. Our approach allows the exact characterization of the almost sure asymptotic singular value and alignments of the corresponding singular vectors with the true spike components, when $\frac{n_i}{\sum_{j=1}^d n_j}\to c_i\in [0, 1]$ with $n_i$'s the tensor dimensions. In contrast to other works that rely mostly on tools from statistical physics to study random tensors, our results rely solely on classical RMT tools such as Stein's lemma. Finally, classical RMT results concerning spiked random matrices are recovered as a particular case.

alignment, singular value, tensor, (15 more...)

arXiv.org Machine Learning

2112.12348

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
North America > United States > Minnesota (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

A Random Matrix Perspective on Random Tensors

Goulart, José Henrique de Morais, Couillet, Romain, Comon, Pierre

arXiv.org Machine LearningAug-2-2021

Tensor models play an increasingly prominent role in many fields, notably in machine learning. In several applications of such models, such as community detection, topic modeling and Gaussian mixture learning, one must estimate a low-rank signal from a noisy tensor. Hence, understanding the fundamental limits and the attainable performance of estimators of that signal inevitably calls for the study of random tensors. Substantial progress has been achieved on this subject thanks to recent efforts, under the assumption that the tensor dimensions grow large. Yet, some of the most significant among these results--in particular, a precise characterization of the abrupt phase transition (in terms of signal-to-noise ratio) that governs the performance of the maximum likelihood (ML) estimator of a symmetric rank-one model with Gaussian noise--were derived on the basis of statistical physics ideas, which are not easily accessible to non-experts. In this work, we develop a sharply distinct approach, relying instead on standard but powerful tools brought by years of advances in random matrix theory. The key idea is to study the spectra of random matrices arising from contractions of a given random tensor. We show how this gives access to spectral properties of the random tensor itself. In the specific case of a symmetric rank-one model with Gaussian noise, our technique yields a hitherto unknown characterization of the local maximum of the ML problem that is global above the phase transition threshold. This characterization is in terms of a fixed-point equation satisfied by a formula that had only been previously obtained via statistical physics methods. Moreover, our analysis sheds light on certain properties of the landscape of the ML problem in the large-dimensional setting. Our approach is versatile and can be extended to other models, such as asymmetric, non-Gaussian and higher-order ones.

expression, tensor, tensor model, (17 more...)

arXiv.org Machine Learning

2108.00774

Country:

Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
(10 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback